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OECD GUIDELINE FOR THE TESTING OF CHEMICALS 


Hershberger Bioassav in Rats: A Short-term Screening Assay for (Anti)Androgenic Properties 
INTRODUCTION 

1. The OECD initiated a high-priority activit>- in 1998 to revise existing guidelines and to develop 
new guidelines for the screening and testing of potential endocrine disrupters (1). One element of the 
activity was to develop a Test Guideline for the rat Hershberger Bioassay. After several decades of use by 
the pharmaceutical industry’, this assay was first standardized by an official expert committee in 1962 as a 
screening tool for androgenic chemicals (2). In 2001-2007, the rat Hershberger Bioassay has undergone an 
extensive validation programme including the generation of a Background Review Document (23), 
compilation of a detailed methods paper (3), development of a dissection guide (21) and the conduct of 
extensive intra- and interlaboratory studies to show' the reliability and reproducibility of the bioassay. 
These validation studies were conducted w'ith a potent reference androgen (testosterone propionate (TP)), 
tvvo potent synthetic androgens (trenbolone acetate and methyl testosterone), a potent antiandrogenic 
pharmaceutical (flutamide), a potent inhibitor of the synthesis (finasteride) of the natural androgen 
(dihydrotestostcrone-DHT), several weakly antiandrogenic pesticides (linuron, vinclozolin, procymidone, 
p,p' DDE), a potent 5a reductase inhibitor (finasteride) and tw'O known negative chemicals (dinitrophenol 
and nonylphenol) (4) (5) (6) (7) (8). This Test Guideline is the outcome of the long historical experience 
with the bioassay and the experience gained during the validation test programme and the results obtained 
therein. 

2. The Hershberger Bioassay is a short-term in vivo screening test using accessory tissues of the 
male reproductive tract. The assay originated in the 1930's and was modified in the 1940’s to include 
androgen-responsive muscles in the male reproductive tract (2) (9-15). In the 1960s, over 700 possible 
androgens w'ere evaluated using a standardized version of the protocol (2) (14), and use of the assay for 
both androgens and antiandrogens was considered a standard method in the 1960s (2) (15). The current 
bioassay is based on the changes in weight of five androgen-dependent tissues in the castrate-peripubertal 
male rat. It evaluates the ability of a chemical to elicit biological activities consistent with androgen 
agonists, antagonists or 5a-reductasc inhibitors. The five target androgen-dependent tissues included in this 
Test Guideline are the ventral prostate (VP), seminal vesicle (SV) (plus fluids and coagulating glands), 
levator ani-bulbocavemosus (LABC) muscle, paired Cowper’s glands (COW) and the glans penis (GP). In 
the castrate-peripubertal male rat, these five tissues all respond to androgens with an increase in absolute 
weight. When these same tissues are stimulated to increase in weight by administration of a potent 
reference androgen, these five tissues all respond to antiandrogens with a decrease in absolute weight. The 
primary model for the Hershberger bioassay has been the surgically castrated pcripubcrtal male, which was 
validated in Phases 1, 2 and 3 of the Hershberger validation program. 

3. The Hershberger bioassay serves as a mechanistic in vivo screening assay for androgen agonists, 
androgen antagonists and 5a-rcductase inhibitors and its application should be seen in the context of the 
■‘OECD Conceptual Framework for the Testing and Assessment of Endocrine Disrupting Chemicals” 
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(Annex 2). In this Conceptual Framework the Hershberger Bioassay is contained in Level 3 as an in vivo 
assay providing data about a single endocrine meehanism, i.e. (anti)androgenieity. It is intended to be 
included in a battery of in vitro and in vivo tests to identify substances with potential to interact with the 
endocrine system, ultimately leading to hazard and risk assessments for human health or the environment. 

4. Due to animal welfare concerns with the castration procedure, the intact (uncastrated) stimulated 
w'eanling male w'as sought as an alternative model for the Hershberger Bioassay to avoid the castration 
step. The stimulated w'eanling test method was validated (24); however, in the validation studies, the 
weanling version of the Hershberger Bioassay did not appear to be able to consistently detect effects on 
androgen-dependent organ weights from weak anti-androgens at the doses tested. Therefore, it was not 
included in this Test Guideline. However, recognising that its use may provide not only animal welfare 
benefits but also mav provide information on other modes of action, it is available in Guidance Document 
115. 

INITIAL CONSIDERATIONS AND LIMITATIONS 

5 . Androgen agonists and antagonists act as ligands for the androgen receptor and may activate or 
inhibit, respectively, gene transcription controlled by the receptor. In addition, some chemicals inhibit the 
conversion of testosterone to the more potent natural androgen dihydrotestosterone in some androgen 
target tissues (5a-reductase inhibitors). Such substances have the potential to lead to adverse health 
hazards, including reproductive and developmental effects. Therefore, the regulatoiy' need exists to rapidly 
assess and evaluate a chemical as a possible androgen agonist or antagonist or 5a-rcductasc inhibitor. 
While informative, the affinity of a ligand for an androgen receptor as measured by receptor binding or 
transcriptional activation of reporter genes in vitro is not the only determinant of possible hazard. Other 
determinants include metabolic activation and deactivation upon entering the body, substance distribution 
to target tissues, and clearance from the body. This leads to the need to screen the possible activity of a 
chemical in vivo under relevant conditions and exposure. In vivo evaluation is less critical if the chemical’s 
characteristics regarding Absorption - Distribution - Metabolism - Elimination (ADME) are known. 
Androgen-dependent tissues respond with rapid and vigorous growth to stimulation by androgens, 
particularly in castrate-peripubertal male rats. Rodent species, particularly the rat, are also w'idely used in 
toxicity studies for hazard characterization. Therefore, the assay version using the castrated peripubertal 
rat and the five target tissues in this assay are appropriate for the in vivo screening of androgen agonists 
and antagonists and 5a-reductase inhibitors. 

6. This Test Guideline is based on those protocols employed in the OECD validation study w'hich 
have been shown to be reliable and reproducible in intra- and interlaboratory studies (4) (5) (6) (7) (8). 
Both androgen and antiandrogen procedures are presented in this Guideline. 

7. Although there was some variation in the dose of TP used to detect antiandrogens in the OECD 
Hershberger Bioassay validation programme by the different laboratories (0.2 versus 0.4 mg/kg/d, 
subcutaneous injection) there was little difference between these two protocol variations in the ability to 
detect weak or strong antiandrogenic activity. However, it is clear that the dose of TP should not be too 
high as to block the effects of weak androgen receptor (AR) antagonists or so low' that the androgenic 
tissues display little growth response even without antiandrogen coadministration. 

8. The growth response of the individual androgen-dependent tissues is not entirely of androgenic 
origin, i.e. compounds other than androgen agonists can alter the w'eight of certain tissues. However, the 
growth response of several tissues concomitantly substantiates a more androgen-specific mechanism. For 
example, high doses of potent estrogens can increase the weight of the seminal vesicles; however, the other 
androgcn-dcpcndcnt tissues in the assay do not respond in a similar manner. Antiandrogenic chemicals can 
act cither as androgen receptor antagonists or 5a-reductase inhibitors. 5a-reductase inhibitors have a 
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variable effect, because the conversion to more potent dihydrotcstostcronc varies by tissue. Antiandrogens 
that inhibit 5a-reductase, like finasteride, have more pronounced effects in the ventral prostate than other 
tissues as compared to a potent AR antagonist, like flutamide. This difference in tissue response can be 
used to differentiate between AR mediated and 5a-reductase mediated modes of action. In addition, the 
androgen receptor is evolutionarily related to that of other steroid hormones, and some other hormones, 
when administered at high, supraphysiological dosage levels, can bind and antagonize the grov\th- 
promoting effects of TP (13). Further, it also is plausible that enhanced steroid metabolism and a 
consequent lowering of scrum testosterone could reduce androgen-dependent tissue growth. Therefore, any 
positive outcome in the Hershberger Bioassay should normally be evaluated using a weight of evidence 
approach, including in vitro assays, such as the AR and estrogen receptor (ER) binding assays and 
corresponding transcriptional activation assays, or from other in vivo assays that examine similar androgen 
target tissues such as the male pubertal assay, 15-day intact adult male assay, or 28-day or 90-day repeat 
dose studies. 

9. Experience indicates that xenobiotic androgens are rarer than xenobiotic antiandrogens. The 
expectation then is that the Hershberger bioassay will be used most often for the screening of 
antiandrogens. However, the procedure to test for androgens could, nevertheless, be recommended for 
steroidal or steroid-like chemicals or for chemicals for which an indication of possible androgenic effects 
was derived from methods contained in Level 1 or 2 of the conceptual framework (Annex 2). Similarly, 
adverse effects associated with (anti)androgenic profiles may be observed in Level 5 assays, leading to the 
need to assess whether a substance operates by an endocrine mode of action. 

10. It is acknowledged that all animal-based procedures should conform to local standards of animal 
care; the descriptions of care and treatment set forth below are minimal performance standards, and will be 
superseded by local regulations. Further guidance of the humane treatment of animals is given by the 
OECD (17). 

11. As in any bioassay using experimental animals, careful considerations should be given to the 
necessity to carry out this study. Basically there may be two reasons for such a decision: 

• high exposure potential (Level 1 of the Conceptual Framew'ork) or indications for 
(anti)androgenicity in in vitro assays (Level 2) supporting investigations whether such effects may 
occur in vivo; 

• effects consistent with (anti)androgenicity in Level 4 or 5 in vivo tests supporting investigations of 
the specific mode of action, e.g., to determine w'hetherthe effects were due to an (anti)androgenic 
mechanism. 

12. Definitions used in this Test Guideline are given in Annex 1 . 

PRINCIPLE OF THE TEST 

13. The Hershberger Bioassay achieves its sensitivity by using males with minimal endogenous 
androgen production. This is achieved through the use of castrated males provided an adequate time after 
castration for the target tissues to regress to a minimal and uniform baseline weight is allowed. Thus, when 
screening of potential androgenic activity, there are low endogenous levels of circulating androgens, the 
hypothalamic - pituitary - gonad axis is rendered unable to compensate via feedback mechanisms, the 
ability of the tissues to respond is maximized, and the starting tissue weight variability is minimized. 
When screening of potential anti-androgenic activity, a more consistent tissue weight gain can be achieved 
when the tissues are stimulated by a reference androgen. As a result, the Hershberger Bioassay requires 
only 6 animals per dose group whereas other assays with intact pubertal or adult males suggest using 15 
males per dose group. 


©OCDE, (2009) 



441 


OECD/OCDE 


14. Castration of pcripubcrtal male rats should be done in an appropriate manner using approved 
anestheties and aseptie technique. Analgesics should be administered on the first few days following 
surgery to eliminate post-surgical discomfort. Castration enhances the precision of the assay to detect weak 
androgens and antiandrogens by eliminating compensatoiy endocrine feed-back mechanisms present in the 
intact animal that can attenuate the effects of administered androgens and antiandrogens and by eliminating 
the large inter-individual variability in semm testosterone levels. Hence, castration reduces the numbers of 
animals required to screen for these endocrine activities. 

15. When screening for potential androgenic activity, the test substance is administered daily by oral 
gavage or subcutaneous (sc) injection for a period of ten consecutive days. Test substances are 
administered to a minimum of two treatment groups of experimental animals using one dose level per 
group. The animals are necropsied approximately 24 hours after the last dose. A statistically significant 
increase in two or more target organ weights of the test substance groups compared to the vehicle control 
group indicates that the test substance is positive for potential androgenic activity (See paragraph 60). 
Androgens, like trenbolone that cannot be 5a-reduced have more pronounced effects on the LABC and 
GP versus TP. but all tissues should display increased growth. 

16. When screening for potential antiandrogcnic activity, the test substance is administered daily by 
oral gavage or subeutaneous injection for a period of ten consecutive days in eoncert with daily TP doses 
(0.2 or 0.4 mg/kg/d) by sc injection. It was determined in the validation programme that either 0.2 or 0.4 
mg/kg/d of TP could be used as both were effective in the detection of antiandrogens and. therefore, only 
one dose should be selected for use in the assay. Graduated test substance doses are administered to a 
minimum of three treatment groups of experimental animals using one dose level per group. The animals 
are necropsied approximately 24 hours after the last dose. A statistically significant decrease in tw'O or 
more target organ weights of the test substance plus TP groups compared to the TP only control group 
indicates that the test substance is positive for potential antiandrogenic activity (See paragraph 61). 

DESCRIPTION OF THE METHOD 

Selection of species and strain 

17. The rat has been routinely used in the Hershberger Bioassay since the 1930s. Although it is 
biologically plausible that both the rat and mouse w'ould display similar responses, based upon 70 years of 
experience with the rat model, the rat is the species of choice for the Hershberger Bioassay. In addition, 
since Hershberger Bioassay data may be preliminaiy to a long-term multigenerational study, this allows 
animals from the same species, strain and source to be used in both studies. 

18. This protocol allows laboratories to select the strain of rat to be used in the assay which should 
generally be that used historically by the participating laboratory. Commonly used laboratory rat strains 
may be used; however, strains that mature significantly later than 42 days of age should not be used since 
castration of these males at 42 days of age could preclude measurement of glans penis weights, which can 
only be done after the prepuce is separated from the penile shaft. Thus, strains derived from the Fisher 344 
rat should not be used, except in rare cases. The Fisher 344 rat has a different timing of sexual 
development compared with other more commonly used strains such as Sprague Daw'ley or Wistar strains 
(16). If such a strain is to be used, the laboratoiy' should castrate them at a slightly older age and be able to 
demonstrate the sensitivity of the strain used. The rationale for the choice of rat strain should be clearly- 
stated by the laboratory. Where the screening assay may be preliminary to a repeated dose oral study, a 
reproductive and developmental study, or a long-term study, preferably animals from the same strain and 
source should be used in all studies. 
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Housing and feeding conditions 

19. All procedures should conform to all local standards of laboratory animal care. These 
descriptions of care and treatment are minimum standards and will be superseded by more stringent local 
regulations, when present. The temperature in the experimental animal room should be 22°C (with an 
approximate range ± 3°C). The relative humidity should be a minimum of 30% and preferably should not 
exceed a maximum 70%, other than during room cleaning. The aim should be relative humidity of 50- 
60%. Lighting should be artificial. The daily lighting sequence should be 12 hours light, 12 hours dark. 

20. Group housing is preferable to isolation because of the young age of the animals and the fact that 
rats are social animals. Housing of two or three animals per cage avoids crow'ding and associated stress 
that may interfere with the hormonal control of the development of the sex accessory' tissue. Cages should 
be thoroughly cleaned to remove possible contaminants and arranged in such a way that possible effects 
due to cage placement are minimized. Cages of a proper size (-2000 square centimeters) will prevent 
overcrowding. 

21. Each animal should be identified individually (e.g., ear mark or tag) using a humane method. 
The method of identification should be recorded. 

22. Laboratory' diet and drinking water should be provided ad libitum. Laboratories executing the 
Hershberger Bioassay should use the laboratory' diet normally used in their chemical testing w'ork. In the 
validation studies of the Bioassay, no effects or variability' were observed that were attributable to the diet. 
The diet used will be recorded and a sample of the laboratory' diet should be retained for possible future 
analysis. 

Performance Criteria for androgen-dependent organ weights 

23. During the validation study, there was no evidence that a decrease in body weight affected 
increases or decreases in the growth of tissue w'eights for target tissues (i.e., that should be weighted in this 
study). 

24. Among the different strains of rat used successfully in the validation programme, androgen- 
dependent organ weights are larger in the heavier rat strains than in the lighter strains. Therefore, the 
Hershberger Bioassay performance criteria do not include absolute expected organ weights for positive and 
negative controls. 

25. Because the Coefficient of Variation (CV) for a tissue has an inverse relationship w ith statistical 
power, the Hershberger Bioassay performance criteria are based on maximum CV values for each tissue 
(Table 1). The CVs are derived from the OECD validation studies. In the case of negative outcomes, 
laboratories should examine the CVs from the control group and the high dose treatment group to 
determine if the maximum CV performance criteria have been exceeded. 

26. The study should be repeated when: 1) three or more of the ten possible individual CVs in the 
control and high dose treatment groups exceed the maximums designated for agonist and antagonist studies 
in Table 1 and 2) at least two target tissues were marginally insignificant, i.e., p values between 0.05 and 
0 . 10 . 

Table 1: Maximum allowable CV.s Determined for the Target Sex Accessory Tissues for the 

castrate model in the OECD Validation Studies'. 


' The threshold CV for a given tissue was identified from a graph of CV values - arranged from smallest 
sequentially to largest - for all means from all experiments in the validation exercise using a specific model (agonist 
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Tissue 

Antiandrogenic effects 

Androgenic effects 

Seminal vesicles 

40% 

40% 

Ventral prostate 

40% 

45% 

LABC 

20% 

30% 

Cowper.s glands 

35% 

55% 

Gians penis 

17% 

22% 


PROCEDURE 

Regulatory compliance and laboratory verification 

27. Unlike the utcrotrophic assay (TG 440), a demonstration of laboratory' competenec prior to the 
initiation of the study is not necessary' for the Hershberger assay because concurrent positive (Testosterone 
Propionate and Flutamide) and negative controls are run as an integral part of the assay. 

Number and condition of animals 

28. Eaeh treated and control group should include a minimum of 6 animals. This applies to both the 
androgenic and antiandrogenic protocols. 

Castration 

29. There should be an initial acclimatization period of several days after receipt of the animals to 
ensure that the animals are healthy and thriving. Since animals castrated before 42 days of age or postnatal 
day (pnd) 42 may not display preputial separation, animals should be castrated on pnd 42 or thereafter, not 
before. The animals arc castrated under anesthesia by placing an incision in the scrotum and removing 
both testes and epididy'mides with ligation of blood vessels and seminal ducts. After confirming that no 
bleeding is occurring, the scrotum should be closed with suture or autoclips. Animals should be treated 
with analgesics for the first few days after surgery' to alleviate any post-surgical discomfort. If castrated 
animals are purchased from an animal supplier, the age of animals and stage of sexual maturity should be 
assured by the supplier. 

Acclimatization after castration 

30. The animals should continue acclimation to the laboratory' conditions to allow for the regression 
in the target tissue weights for a minimum of 7 days following castration. Animals should be observed 
daily, and any animals with evidence of disease or physical abnormalities should be removed. Thus, 
treatment with initiation of dosing (on study) may commence as early as pnd 49 days of age, but not later 
than pnd 60. Age at necropsy should not be greater than pnd 70. This flexibility allows a laboratory' to 
schedule the experimental work efficiently. 


or antagonist). The threshold CV was read from the point at which the increments between to the next highest CVs 
m the series are dramatically larger than the preceding few CV.s- the “breakpoint”. It should be noted that although 
this analysis identified relatively reliable “breakpoints” for the antagonist model of the assay, CV eurves for the 
agonist assay showed a more uniform increase making identifieation of a threshold CV by this method somewhat 
arbitrary'. 
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Body weight and group randomization 

3 1 . Differences in individual body weights are a source of variability in tissue w'eights both within 
and among groups of animals. Increasing tissue weight variability' results in an increased coefficient of 
variation (CV) and decreases the statistical power of the assay (sometimes referred to as assay sensitivity). 
Therefore, variations in body weight should be both experimentally and statistically controlled. 

32. Experimental control involves producing small variations in body weight within and among the 
study groups. First, unusually small or large animals should be avoided and not placed in the study cohort. 
At study commencement the weight variation of animals used should not exceed ± 20 % of the mean 
weight (e.g. 175g ± 35g for castrated peripubertal rats). Second, animals should be assigned to groups 
(both control and treatment) by randomized w'eight distribution, so that mean body w eight of each group is 
not statistically different from any other group. The block randomization procedure used should be 
recorded. 

33. Because toxicity may decrease the body weight of treated groups relative to the control group, the 
body W'eight on the first day of test substance administration could be used as the statistical covariate, not 
the body w'eight at necropsy. 

Dosage 

34. In order to establish whether a test substance can have androgenic action in vivo, two dose groups 
of the test substance plus positive and vehicle (negative) controls (See paragraph 3) are normally sufficient, 
and this design is therefore preferred for animal W'elfare reasons. If the purpose is either to obtain a dose- 
response curve or to extrapolate to low'cr doses, at least 3 dose groups arc needed. If information beyond 
identification of androgenic activity (such as an estimate of potency) is required, a different dosing regime 
should be considered. To test for antiandrogens, the test substance is administered together w'ith a reference 
androgen agonist. A minimum of 3 test groups with different doses of the test chemical and a positive and 
a negative control (See paragraph 44) should be used. Except for treatment with the test substance, 
animals in the control group should be handled in an identical manner to the test group subjects. If a 
vehicle is used in administering the test substance, the control group should receive the vehicle in the 
highest volume used w'ith the test groups. 

35. All dose levels should be proposed and selected taking into account any existing toxicity and 
(toxico-) kinetic data available for the test substance or related materials. The highest dose level should 
first take into consideration the LD 50 and/or acute toxicity information in order to avoid death, severe 
suffering or distress in the animals (17) (18) (19) (20) and, second, take into consideration available 
information on the doses used in subchronic and chronic studies. In general, the highest dose should not 
cause a reduction in the final body weight of the animals greater than 10% of control weight. The highest 
dose should be either 1 ) the highest dose that ensures animal survival and that is without significant 
toxicity or distress to the animals after 10 consecutive days of administration up to a maximal dose of 1000 
mg/kg/day (See paragraph 36) or 2) a dose inducing (anti)androgenic effects, whichever is lower. As a 
screen, large intervals, e.g., one half log units (corresponding to a dose progression of 3.2) or even one log 
units, between dosages arc acceptable. If there arc no suitable data available, a range finding study (See 
paragraph 37) may be performed to aid the determination of the doses to be used. 

Limit dose level 

36. If a test at the limit dose of 1000 mg/kg body weight/day and a low'cr dose using the procedures 
described for this study, fails to produce a statistically significant change in reproductive organ w'eights. 
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then additional dose levels may be eonsidcred unneeessary. The limit dose applies except when human 
exposure data indicate the need for a higher dose level to be used. 

Considerations for range finding 

37. If necessary, a preliminary range finding study can be carried out with a few animals to select the 
appropriate dose groups [modified OECD guidelines for acute toxicity testing (TG 420, TG 423, TG 425)]. 
The objective in the case of the Hershberger Bioassay is to select doses that ensure animal survival and that 
are without significant toxicity or distress to the animals after ten consecutive days of chemical 
administration up to a limit dose of 1000 mg/kg/d as noted in paragraphs 35 and 36. In this respect an 
OECD Guidance Document (17) may be used defining clinical signs indicative of toxicity or distress to the 
animals. If feasible within this range finding study after ten days of administration, the target tissues may- 
be excised and w'eighed approximately 24-hours after the last dose is administered. These data could then 
be used to assist the selection of the doses in the main study. 

Reference substances and vehicle 

38. The reference androgen agonist should be Testosterone Propionate (TP), CAS No 57-82-5. The 
reference TP dosage may be either 0.2 mg/kg-b\v/d or 0.4 mg/kg-bw/d. The reference androgen antagonist 
should bo Flutamidc (FT), CAS No 13 1 1-84-7. The reference FT dosage should be 3 mg/kg-bw/d, and the 
FT should be coadministered with the reference TP dosage. 

39. It is recommended that, wherever possible, the use of an aqueous solution/suspension be 
considered first. However, since many androgen ligands or their metabolic precursors tend to be 
hydrophobic, the most common approach is to use a solution/suspension in oil (c.g. com, peanut, sesame or 
olive oil). Test substances can be dissolved in a minimal amount of 95% ethanol or other appropriate 
solvents and diluted to final working concentrations in the test vehicle. The toxic characteristics of the 
solvent should be knowm. and should be tested in a separate solvent-only control group. If the test 
substance is considered stable, gentle heating and vigorous mechanical action can be used to assist in 
dissolving the test substance. The stability of the test substance in the vehicle should be determined. If the 
test substance is stable for the duration of the study, then one starting aliquot of the test substance may be 
prepared, and the specified dosage dilutions prepared daily using care to avoid contamination and spoilage 
of the samples. 

Administration of doses 

40. TP should be administered by subcutaneous injection, and FT by oral gavage. 

41. The test substance is administered by oral gavage or subcutaneous injection. Animal welfare 
considerations and the physical/chemical properties of the test substance need to be taken into account 
w'hen choosing the route of administration. In addition, toxicological aspects like the relevance to the 
human route of exposure to the chemical (e.g. oral gavage to model ingestion, subcutaneous injection to 
model inhalation or dermal adsorption) and existing toxicological information and data on metabolism and 
kinetics (e.g. need to avoid first pass metabolism, better efficiency via a particular route) should be taken 
into account before extensive, long-term testing is initiated if positive results are obtained by injection. 

42. The animals should be dosed in the same manner and time sequence for ten consecutive days at 

approximately 24 hour intervals. The dosage level should be adjusted daily based on the concurrent daily- 
measures of body weight. The volume of dose and time that it is administered should be recorded on each 
day of exposure. Care should be taken in order not to exceed the maximum dose described in paragraph 35 
to allow a meaningful interpretation of the data. Reduction of body weight, clinical signs, and other 
findings should be thoroughly- assessed in this respect. For oral gavage, a stomach tube or a suitable 
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intubation cannula should be used. The maximum volume of liquid that can be administered at one time 
depends on the size of the test animal. Local animal care guidelines should be followed, but tbe volume 
should not exceed 5 ml/kg body weight, except in the case of aqueous solutions where 10 ml/kg body- 
weight may be used. For subcutaneous injections, doses should be administered to the dorsoscapular and 
or lumbar regions via sterile needle (e.g. 23- or 25-gauge) and a tuberculin syringe. Shaving the injection 
site is optional. Any losses, leakage at the injection site or incomplete dosing should be reeorded. The 
total volume injected per rat per day should not exceed 0.5 ml/kg body weight. 

Specific procedures for androgen agonists 

43 . For the test for androgen agonists, the vehicle is the negative control, and the TP-treated group is 
the positive control. Biological activity consistent with androgen agonists is tested by administering a test 
substance to treatment groups at tbe selected doses for 10 eonsecutive days. The weights of the five sex 
accessory tissues from the test substance groups are compared to the vehicle group for statistically 
significant increases in weight. 

Specific procedures for androgen antagonists and 5a-reductase inhibitors 

44. For the test for androgen antagonists and 5a-reductase inhibitors, the TP-treated group is the 
negative control, and the group coadministered with reference doses of TP and FT is the positive control. 
Biological activity consistent with androgen antagonists and 5a-reductasc inhibitors is tested by- 
administering a reference dose of TP and administering the test substanee for 10 eonsecutive days. The 
weights of the five sex aecessory- tissues from the TP plus test substance groups are compared to the 
reference TP-only group for statistically significant decreases in w-eights. 

OBSERVATIONS 

Clinical observations 

45 . General clinical observations should be made at least once a day and more frequently w-hen signs 
of toxicity are observed. Observations should be carried out preferably at the same time(s) each day and 
considering the period of anticipated peak effects after dosing. All animals should be observed for 
mortality, morbidity- and general clinical signs such as changes in behaviour, skin, fur, eyes, mucous 
membranes, occurrence of secretions and excretions and autonomic activity- (e.g. lacrimation, piloerection, 
pupil size, unusual respiratory' pattern). 

46. Any animal found dead should be removed and disposed of w'ithout further data analysis. Any- 
mortality of animals prior to necropsy should be included in the study record together with any apparent 
reasons for mortality. Any moribund animals should be humanely terminated. Any moribund and 
subsequently euthanized animals should be included in the study- record with apparent reasons for 
morbidity-. 

Body weight and food consumption 

47. All animals should be weighed daily to the nearest 0. 1 g, starting just prior to initiation of 
treatment i.c., w'hen the animals are allocated into groups. As an optional measurement, the amount of food 
consumed during the treatment period may be measured per eage by- w-eighing the feeders. The food 
consumption results should be expressed in grams per rat per day. 
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Dissection and measurement of tissue and organ weights 

48. Approximately 24 hours after the last administration of the test substance, the rats should be 
euthanized and exsanguinated according to the normal procedures of the conducting laboratory', and 
necropsy carried out. The method of humane killing should be recorded in the laboratory report. 

49. Ideally, the necropsy order should be randomized across groups to avoid progression directly up 
or down dose groups that could affect the data. Any finding at necropsy, i.e., pathological changes/visible 
lesions should be noted and reported. 

50. The five androgen-dependent tissues (VP, SV, LABC, COW, GP) should be weighted. These 
tissues should be excised, carefully trimmed of excess adhering tissue and fat, and their fresh (unfixed) 
weights determined. Each tissue should be handled with particular care to avoid the loss of fluids and to 
avoid desiccation, which may introduce significant errors and variability by decreasing the recorded 
weights. Several of the tissues may be very' small or difficult to dissect, and this will introduce variability. 
Therefore, it is important that persons carry'ing out the dissection of the sex accessory tissues are familiar 
with standard dissection procedures for these tissues. A standard operating procedure (SOP) manual for 
dissection is available from the OECD (21). Careful training according to the SOP guide will minimize a 
potential source of variation in the study. Ideally the same prosector should be responsible for the 
dissection of a given tissue to eliminate inter-individual differences in tissue processing. If this is not 
possible, the necropsy should be designed such that each prosector dissects a given tissue from all 
treatment groups as opposed to one individual dissecting all tissues from a control group, while someone 
else is responsible for the treated groups. Each sex accessory' tissues should be weighed without blotting to 
the nearest 0. 1 mg, and the weights recorded for each animal. 

5 1 . Several of the tissues may be very small or difficult to dissect, and this will introduce variability. 
Previous work has indicated a range of coefficient of variations (CVs) that appears to differ based upon the 
proficiency of the laboratory. In a few cases, large differences in the absolute weights of the tissues such 
as the VP and COWS have been observed within a particular laboratory'. 

52. Liver, paired kidney, and paired adrenal weights are optional measurements. Again, tissues 
should be trimmed free of any adhering fascia and fat. The liver should be weighed and recorded to the 
nearest 0. 1 g, and the paired kidneys and paired adrenals should be weighed and recorded to the nearest 0.1 
mg. The liver, kidney and adrenals are not only influenced by androgens; they also provide useful indices 
of systemic toxicity. 

53. Measurement of serum luteinising hormone (LH), follicular stimulating hormone (FSH) and 
testosterone (T) is optional. Scrum T levels arc useful to determine if the test substance induces liver 
metabolism of testosterone, low'ering serum levels. Without the T data, such an effect might appear to be 
via an antiandrogenic mechanism. LH levels provides information about the ability of an antiandrogen to 
not only reduce organ weights, but also to affect hypothalamic-pituitary function, which in long term 
studies can induce testis tumors. FSH is an important hormone for spermatogenesis. Serum T4 and T3 also 
arc optional measures that would provide useful supplemental information about the ability to disrupt 
thyroid honnonc homeostasis. If hormone measurements arc to be made, the rats should be anesthetized 
prior to necropsy and blood taken by cardiac puncture, and the method of anesthesia should be chosen with 
care so that it does not affect hormone measurement. The method of serum preparation, the source of 
radioimmunoassay or other measurement kits, the analytical procedures, and the results should be 
recorded. LH levels should be reported as ng per ml of seram, and T should also be reported as ng per ml 
of serum. 
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54. The dissection of the tissues is described as follows with a detailed dissection guide with 
photographs published as supplementarj- materials as part of the validation programme (21). A dissection 
video is also available from the Korea Food and Drug Administration web page (22). 


• With the ventral surface of the animal upw ards, determine if the prepuce of the penis has 
separated from the glans penis. If so, then retract the prepuce and remove the glans penis, weigh 
(nearest 0.1 mg), and record the weight; 

• Open the abdominal skin and wall, exposing the viscera. If the optional organs are weighed, 
remove and weigh liver to nearest 0. 1 g, remove the stomach and intestines, remove and weigh 
the paired kidneys and paired adrenals to the nearest 0.1 mg. This dissection exposes the 
bladder and begins the dissection of the target male accessoiy tissues. 

• To dissect the VP, separate bladder from the ventral muscle layer by cutting connective 
tissue along the midline. Displace the bladder anteriorly towards the seminal vesicles 
(SV), revealing the left and right lobes of the ventral prostate (covered by a layer of fat). 
Carefully tease the fat from the right and left lobes of the VP. Gently displace the VP 
right lobe from the urethra and dissect the lobe from the urethra. While still holding the 
VP right lobe, gently displace the VP left lobe from the urethra and then dissect; weigh to 
nearest 0. 1 mg and record the weight. 

• To dissect the SVCG, displace the bladder caudally, exposing the vas deferens and right and left 
lobes of the seminal vesicles plus coagulating glands (SVCG). Prevent leakage of fluid b\- 
clamping a hemostat at the base of the SVCGs, where the vas deferens joins the urethra. 
Carefully dissect the SVCGs, w'ith the hemostat in place trim fat and adnexa aw'ay, place 
in a tared weigh-boat, remove the hemostat, and weigh to the nearest 0.1 mg and record the 
w'eight. 

• To dissect the levator ani plus bulbocavemosus muscles (LABC), the muscles and the base of the 
penis are exposed. The LA muscles w'rap around the colon, while the anterior LA and BC 
muscles arc attached to the penile bulbs. The skin and adnexa from the perianal region extending 
from the base of the penis to the anterior end of the anus arc removed. The BC muscles arc 
gradually dissected from the penile bulb and tissues. The colon is cut in two and, the full 
LABC can be dissected and removed. The LABC should be trimmed of fat and adnexa, 
w'eighed to the nearest 0. 1 mg, and record the weight. 

• After the LABC has been removed, the round Cowper's or bulbourethral glands (COW) 
are visible at the base of, and slightly dorsal to, the penile bulbs. Careful dissection is required to 
avoid nicking the thin capsule in order to prevent fluid leakage. Weigh the paired COW to the 
nearest 0. 1 mg, and record the weight. 

• In addition, if fluid is lost from any gland during the necropsy and dissection, this should be 
recorded. 

55. If the evaluation of each chemical requires necropsy of more animals than is reasonable for a 
single day, the study start may be staggered on two consecutive days, resulting in the staggering of the 
necropsy and the related work over tw'o days. If staggered in this manner, one-half of the animals per 
treatment group should be used per day. 

56. Carcasses should be disposed of in an appropriate manner following necropsy. 
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REPORTING 

Data 

57. Data should be reported individually (i.e. body weight, accessory' sex tissue weights, optional 
measurements and other responses and observations) and for each group of animals (means and standard 
deviations of all measurement taken). The data should be summarized in tabular form. The data should 
show the number of animals at the start of the test, the number of animals found dead during the test or 
found showing signs of toxicity, a description of the signs of toxicity observed, including time of onset, 
duration and severity. 


58. A final report should include : 

Testing facility 

- Name of facility, location 

- Study director and other personnel and their study responsibilities 

- Dates the study began and ended, i.e., first day of test substance administration and last day of 
necropsy, respectively. 

Test substance 

- Source. lotTbatch number, identity, purity, full address of the supplier and characterization of the 
test substanee(s) 

- Physical nature and, where relevant, physicochemical properties; 

- Storage conditions and the method and frequency of dilution preparation 

- Any data generated on stability 

- Any analyses of dosing solutions/suspensions. 


Vehicle 


- Characterization of the vehicle (identity, supplier and lot #) 

- Justification of the vehicle choice (if other than water) 


Test animals and animal husbandry procedures 

- Species/strain used and rationale for choice 

- Source or supplier of animals, including full address 

- Number and age of animals supplied 

- Housing conditions (temperature, lighting, and so on) 

- Diet (name, ty'pc, supplier, lot number, content and if known, phytocstrogens levels) 

- Bedding (name, type, supplier, content) 

- Caging conditions and number of animals per cage; 

Assay Conditions 

- Age at castration and duration of acclimatization after eastration; 

- Individual weights of animals at the start of the study (to nearest 0.1 g); 

- Randomization process and a record of the assignment to vehicle, reference, test 
substance groups, and cages 
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- Mean and standard deviation of the body weights for caeh group for eaeh weigh day throughout 
the study; 

- Rationale for dose selection 

- Route of administration of test substance and rationale for the choice of exposure route 

- If an assay for antiandrogenicity, the TP treatment (dose and volume), 

- Test substanee treatment (dose and volume), 

- Time of dosing 

- Necropsy procedures, including means of exsanguinations and any anesthesia 

- If serum analyses are performed, details of the method should be supplied. For example, if RIA 
is used, the RIA procedure, source of RIA kits, kit expiration dates, procedure for scintillation 
counting, and standardization should be reported. 


Results 


- Daily observations for each animal during dosing, including: 

- Body weights (to the nearest 0.1 g), 

- Clinical signs (if any), 

- Any measurement or notes of food consumption. 

- Necropsy observations for each animal, including: 

- Date of necropsy, 

- Animal treatment group, 

- Animal ID. 

- Prosector, 

- Time of day necropsy and dissection are performed, 

- Animal age, 

- Final body weight at necropsy, noting any statistically significant increase or decrease, 

- Order of animal exsanguination and dissection at necropsy, 

- Weights of the five target androgen dependent tissues: 

- Ventral prostate (to the nearest 0.1 mg) 

- Seminal vesicles plus coagulating glands, including fluid (paired, to nearest 0.1 
mg) 

- Levator ani plus bulbocaveraosus muscle complex (to nearest 0.1 mg) 

- Cowper’s glands (fresh weight - paired, to nearest 0. 1 mg). 

- Gians penis (fresh weight to nearest 0. 1 mg), 

- Weights of optional tissues, if perfonned: 

- Liver (to nearest 0. 1 g) 

- Kidney (paired, to nearest 0.1 mg) 

- Adrenal (paired, to nearest 0. 1 mg) 

- General remarks and comments 

- Analyses of serum hormones, if performed. 

- Serum LH (optional - ng per ml of serum), and 

- Scrum T (optional - ng per ml of scrum) 

- General remarks and comments 


Data summarization 

Data should be summarized in tabular form containing the sample size for each group, the mean of the 
value, and the standard error of the mean or the standard deviation. Tables should include necropsy body 
weights, body weight changes from the beginning of dosing until necropsy, target accessory' sex tissues 
weights, and any optional organ weights. 
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Discussion of the results 
Analysis of results 

59. Necropsy body and organ weights should be statistically analyzed for characteristics such as 
homogeneity of variance with appropriate data transformations as needed. Treatment groups should be 
compared to a control group using techniques such as ANOVA followed by pairwise comparisons (e.g. 
Dunnett’s one tailed test) and the criterion for statistical difference, for example, p< 0.05. Those groups 
attaining statistical significance should be identified. However, "relative organ" weights should be avoided 
due to the invalid statistical assumptions underlying this data manipulation. 

60. For androgen agonism, the control should be the vehicle-only test group. The mode of action 
characteristics of a test substance can lead to different relative responses amongst the tissues, for example 
trenbolone, which cannot be 5 alpha-reduced, has more pronounced effects on the LABC and GP than does 
TP. A statistically significant increase (p< 0.05) in any two or more of the five target androgen-dependent 
tissue weights (VP, LABC, GP, CG and SVCG) should be considered a positive androgen agonist result, 
and all the target tissues should display some degree of increased growth combined evaluation of all 
accessory sex organs (ASO) tissue responses could be achieved using appropriate multivariate data 
analysis. This could improve the analysis, especially in cases where only a single tissue gives a statistically 
significant response. 

61. For androgen antagonism, the control should be the reference androgen (testosterone propionate 
only) test group. The mode of action characteristics of a test substance can lead to different relative 
responses amongst the tissues, for example 5 alpha a-reductase inhibitors, like finasteride, have more 
pronounced effects on the ventral prostate than other tissues as compared to potent AR antagonists, like 
flutamide. A statistically significant reduction (p < 0.05) in any tw'o or more of the five target androgen- 
dependent tissue weights (VP, LABC, GP, CG and SVCG) relative to TP treatment alone should be 
considered a positive androgen antagonist result and all the target tissues should display some degree of 
reduced growth. Combined evaluation of all ASO tissue responses could be achieved using appropriate 
multivariate data analysis. This could improve the analysis, especially in cases where only a single tissue 
gives a statistically significant response. 

62. Data should be summarized in tabular form containing the mean, standard error of the mean 
(standard deviation would also be acceptable) and sample size for each group. Individual data tables should 
also be included. The individual values, mean, SE (SD) and CV values for the control data should be 
examined to determine if they meet acceptable criteria for consistency with expected historical values. CVs 
that exceed CV values listed in Table 1 (see paragraphs 25 and 26) for each organ weight should determine 
if there arc errors in data recording or entry or if the laboratoiy has not yet mastered accurate dissection of 
the androgen-dependent tissues and further training/practice is w'arranted. Generally, CVs (the standard 
deviation divided by the mean organ weight) are reproducible from lab to lab and study to study. Data 
presented should include at least; ventral prostate, seminal vesicle, levator ani plus bulbocavernosus, 
Cowper's glands, glans penis, liver, and body weights and body weight change from the beginning of 
dosing until necropsy. Data also may be presented after covariance adjustment for body weight, but this 
should not replace presentation of the unadjusted data. In addition, if preputial separation (PPS) does not 
occur in any of the groups, the incidence of PPS should be recorded and statistically compared to the 
control group using Fisher Exact test. 

63. When verifying the computer data entries with the original data sheets for accuracy, organ weight 
values that are not biologically plausible or vary by more than three standard deviations from that 
treatment group means should be carefully scrutinized and may need to be discarded, likely being 
recording errors. 
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64. Comparison of study results with OECD CV values (in Table 1) is often an important step in 
interpretation as to the validity of the study results. Historieal data for vehiele control groups should be 
maintained in the laboratory. Historical data for responses to positive reference substances, such as TP and 
FT. should also be maintained in the laboratory'. Laboratories may also periodically test the response to 
known weak androgen agonists and antagonists and maintain these data. These data can be compared to 
available data OECD data to ensure that the laboratory’s methods yield sufficient statistical precision and 
power. 
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DEFINITIONS 


ANNEX 1 


Androgenic is a temi used to describe a positive influence on the growth of androgen-dependent tissues 
Antiandrogenic is the capability of a chemical to suppress the action of TP in a mammalian organism. 
Date of birth is postnatal day 0. 

Dose is the amount of test substance administered. For the Hershberger Bioassay, the dose is expressed as 
weight of test substance per unit body weight of test animal per day (e.g. mg/kg body weighriday). 

Dosage is a general term comprising of dose, its frequency and the duration of dosing. 


Moribund is a term used to describe an animal in a dying state, i.e., near the point of death. 
Postnatal day X is the Xth day of life after the day of birth. 


Sensitivity is the capability of a test method to correctly identify chemicals having the property that is 
being tested for. 

Specificity is the capability of a test method to correctly identify- chemicals not having the property that is 
being tested for. 


Validation is a scientific process designed to characterize the operational requirements and limitations of a 
test method and to demonstrate its reliability and relevance for a particular purpose. 
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ANNEX 2 


Note : Document prepared by the Secretariat of the Test Guidelines Programme based on the agreement reached at the 6th Meeting of the EDTA Task Force 

OECD Conceptual Framework for the Testing and Assessment of Endocrine Disrupting Chemicals 





Level 1 

Sorting & prioritization based 
upon existing information 

■ 

-physical & chemical piopeiiies, eg., MW, reactivity, volatility, biodegradability, 

- human & environmental exposure, e g., production volume, release, use patterns 

- hazard, e.g., available toxicological data 

1 








Level 2 

In vitro assays providing 
mechanistic data 

■ 

-ER.AR.tr receptor binding affinity -Higli Tliroiiglr Put Prescreens 

- Transcriptional activation - Thyroid flinctioii 

- Ai'omatase and steroidogenesis in vitro - Fish hepatocyte VTG assay 

- Aryl hydi'ocai'bon receptor recognition/binding - Others (as appropriate) 

- QSARs 





Level 3 

In vivo assays providing data 
about single endocrine 
Mechanisms and effects 

1 

- Uterotiopliic assay (estrogenic related) 

- Hershberger assay (audiogenic related) 

- Non -receptor mediated homione function 

- Others (e g. thyroid) 


- Fish VTG (vitellogeiun) assay 
(estrogenic related) 




Level 4 

In vivo assays providing data 
about multiple endocrine 
Mechanisms and effects 

■ 

- enlranced OECD 407 (endpoints based on 
endociine meclianisiiis) 

- male and female pubeital assays 

- adult intact male assay 


- Fish gonadal liistopatliology assay 

- Frog metamoiphosis assay 




Level 5 

In vivo assays providing data on 
effects from endocrine «& 
other mechanisms 


- 1 -generation assay ( 1 Ct 4 1 .7 enhanced)' 

- 2-generation assay (TG416 enhanced)^ 

- reproductive screening test (TG421 enlianced)’ 

- combined 28 day/reproduction screening test 

(TG 4 22 enhanced)’ 


- Paitial and fiitl life cycle assays 
in fish, birds, amphibians & 
invertebrates (developmental and 
I eproduction) 



VMG mamm; Validation Management Group on Mammalian Testing and Assessment 
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Notes to the Framework 


Note 1: Entering at all levels and exiting at all levels is possible and depends upon the 
nature of existing information needs for hazard and risk assessment purposes 

Note 2: In level 5,ecotoxicology should include endpoints that indicate mechanisms of 
adverse effects, and potential population damage 

Note 3: When a multimodal model covers several of the single endpoint assays, that 
model would replace the use of those single endpoint assays 

Note 4; The assessment of each chemical should be based on a case by case basis, taking 
into account all available information, bearing in mind the function of the framework 
levels. 

Note 5; The framework should not be considered as all inclusive at the present time. At 
levels 3,4 and 5 it includes assays that are either available or for which validation is under 
way. With respect to the latter, these are provisionally included. Once developed and 
validated, they will be formally added to the framework. 

Note 6: Level 5 should not be considered as including definitive tests only. Tests included 
at that level are considered to contribute to general hazard and risk assessment. 
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